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IMPROVED VERSION OP THE SPLIT ROUTINE FOR CLASSY 


Introduction 

The SPLIT routine originally Implemented for CLASSY was 
Intended to be a quickly written, temporary routine to get the 
system running. In fact, another version had been designed and 
coded earlier, but was Judged too difficult to debug quickly. 

The original version of SPLIT attempted to minimize a 
quadratic form of the difference between the observed skewness 
and kurtosls and the skewness and kurtosis produced by the 
model. The model was then adjusted by a kind of steepest des- 
cents approach until further adjustment would not prove practi- 
cal. This frequently requires 50 or more iterations, which can 
consume a fairly large amount of machine time. In fact, with 
other speedups of CLASSY being implemented by Elogic, SPLIT 
could well become the primary consumer of machine time. 

In addition, analyses done since the original SPLIT was 
encoded indicate that it may not give the best solution for 
common situations. Thus a new SPLIT routine would have to pro- 
duce a better set of solutions than the original version. 

For both of the above reasons, Elogic designed a new 
SPLIT routine which is being implemented by Lockheed. 
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Calculation Procedure 

We wish to find approximate values for the parameters 
(mean, covariance, and proportion) of two distributions, given 
the same parameters plus the traces of skewness and kurtosls 
for the mixture distribution. The calculation is done in a 
coordinate frame where the mixture covariance is the unit 
matrix, which allows us to easily calculate inner products 
with respect to that covariance, etc. 

The two clusters to be found are defined by 

cluster 

proportion a 3 

mean y v 

covariance 

We are given S (the skewness vector S^=(x^Xj^) ); and the 
input kurtosls matrix K^, which is Immediately used to derive 
the practical kurtosls matrix, K=K^-(d+2)E, where d is the 
dimensionality of the space. 

The initial variables are obtained as in the present 
SPLIT routine, and are used to set up the new clusters as in 
that routine. The reader is referred to the documentation for 
SPLIT Version I for the details of these calculations. In 
addition the rotation to the frame with unit covariance and the 
initialization of the iterations is done as in Version I. 
Version II differs from Version I only in the form of the 
iterative step and convergence calculation, and in the handling 
of certain special conditions. 
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We wish to solve the equations 


■ - 66^ + = 1 (covariance — a symmetric 

matrix, 1 means the 
unit matrix) 

(la) 

(2D^6 + 6 trD^-Y^^5) = S (skewness — a vector) 

(lb) 


(2D'' + trD^ + 525 6^ _ 


(Ic) 


2y 


(D^a) 6^+6 (D^6)^ 


ya^D^ - Y trD^6 6)^ = K 

(kurtosis, (d+2)E" 
removed; a symmetric 
matrix) 


where 


-1<Y£1 is the mixing parameter 

6 is the displacement vector between the clusters (=p-v) 

(2a) 

d2 = a^-T^ • (2b) 


The unknowns are a^, 6, and y. 

This system has one more unknown than equations, requir- 
ing an additional assumption. The simplest such assumption is 
Y=0; other possibilities are trD^=0, etc. We will use 
(equal weight clusters), here, but some hueristic allowance 
must be made at the end of the calculation to avoid wild solu- 
tions. Some checking of solutions must be done in any event to 
avoid problems caused by multiple clusters, etc. 


The equations to be solved become 


^ (2D^6 + 6 trD^) = S (3a) 

I (2D‘» + D^trD^ - ^^t) ^ ^ (where 6^ = 26^^) (3b) 

which are to be solved for 6 and , (Note that and appear 
here only via , ) The definition of , along with the equation 

^ 56^ + I + I = 1, (3c) 

can be treated separately and later than (3a) and (3b) except for 
the positivity constraint on and . This requirement is 
handled as a special constraint during the solution of the system 
(2b) and (3c )j If no solution is possible, the error solution is 
taken. 

Under normal circum.stances , the matrix K will have exactly 
one negative eigenvalue. If K has no negative eigenvalues, 
this Indicates that the two hypothesized clusters differ primar- 
ily In covariance (a condition which might reasonably be flagged). 
A situation where K has multiple negative eigenvalues can be 
contrived with only two clusters, but this generally indicates 
multiple clusters, requiring a more even division of the space 
for later additional SPLITting. 

Additional Conditions 

Besides the actual numeric solution, the following special 
conditions must be observed: 
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1) If there are excess negative eigenvalues helow a 
certain threshold, probably indicating multiple subclusters, 
the "basic" (initial) solution is used. A message condition 
is also raised. 

2) If the solution discovered has covariance matrices 
which v;ould divide the space too unevenly, then the "basic" 
solution is also used, with a message condition. 

3) The "basic" solution and a third' message condition 
are used if the numerical iteration falls to converge. 

The "basic" solution is a hueristlcally calculated solu- 
tion to the equations which is used as the initial solution and 
as a last resort solution in the case of errors. It is in fact 
the initial solution for the original SPLIT routine. 
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Iteration Procedure for Version II of SPLIT 

Write AS - ^ (2D*6 + 5 trD^) - S 

AK « ^ (2D** + tS't^ - I 6®66^) - K (4b) 

for the solution errors. (These will be 0 if 6 and are 
correct.) The initial values are determined by the "basic" 
procedure, described above. The iterations are done in a 
frame where is constantly forced to be a diagonal matrix — 

that is, if is corrected by an off-diagonal term, the coordi- 

nate frame is rotated to make be diagonal again. (This 
requires rotating D^, 5, K, and S.) 

Thus the frame in whici'i the problem is stated rotates from 
iteration to iteration, overall from one in which K is diagonal 
to one in which the final solution is diagonal. These 
rotations must be stored in a separate rotation matrix which 
gives the net rotation between the transformed frame and the 
original one. When making the rotations, 6 and are both 
rotated incrementally to the new frame, and S and K are rotated 
from their original values t j> the new frame by the accumulated 
transformation before AS and AK are calculated. 

The variables are separated into two groups; 

1) 5 and the diagonal elements of 

2) the off-diagonal elements of 
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The variables of Group 1 are modified during iteration 
using a matrix version of Newton’s method; those in Group 2 
are modified by direct substitution (Altken iteration), 
implemented by frame rotation. 

Iteration on 6 and the dlaKonal elements of ; 

We take AS and the diagonals of AK and use them to cor- 
rect 6 and the diagonals of , via 



(2dx2d mtx) 


Since this is a Newton’s method technique, it is subject 
to bad overshoots, etc. Therefore, a test must be made that the 
various error terms AS and diag AK decrease due to the iteration, 
if they do not, the increment given to 6 and diag can be 
decreased. (For small enough increment, they are guaranteed to 
decrease.) This test may be applied to the whole iteration, not 
just the Newton’s method part; this would save the need to make 
duplicate calculations of AS and AK. In the worst case of non- 
convergence, the ’’basic” solution could be taken, with the rais- 
ing of an error flag. 
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The various derivatives are; 


3AS 


i _1 
k ^ 


D' 


ik ^ °ik 


6.,.trD‘ 


(5a) is a Kronecker delta function, 

=1 if i=k, else 0; not to be con- 
fused with 6^ the vector, which 

has 1 or no subscripts, instead 
of 2) 


3AS 


8D‘ 


L- = 

\a 


1 A 


^.'^ki 






(5b) 


3AK 


■M. = I 6, 6, ,6, - i 6 ^, . 6 . 6 =^ - t 6 ., 6 . 6 ^ 


^ T °i°k°j - F °ik°J°" " F °Jk°i°" '^i^i^ ^50 


9AK. j n n 

k5, 


+ 





(5<a) 


Although these equations include the off-diagonal terms 
of AK and D^, only the diagonal terms are used in the Newton’s 
method matrix,, 

t 

Off diagonal elements 

The off diagonal elements of are adjusted using direct 
substitution. The off diagonal components of AK are used to 
determine the change in required. The amount of change 
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required can be determined from For i?^j , Wz, 


3AK 


3D 


i.1 _ 1 

2 ” ? 




J °jk 


( 6 ) 


Note that D^ is diagonal (in the current coordinate frame) so 
that either is=k and or i=£ and J=k for the derivative to be 
non-zero. By symmetry conditions on AK and D^, we may assume 
i>J and k>A, i=k and J=ii so that we calculate 



D 


kj^ ] 
new j 

■^off diagonal 


= -2 


2D' 


kk 


+ trD' 


-1 


AK 


kZ 


The two-part calculation of 6 and D^ is basically a Newton's 
method calculation with the derivative terms between the parts 


treated as 0. 
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Final Solution 

After and t have been calculated, the program must 
check that the variances and are satisfactory. This is 
done via the relations 

1 = 

0 ^ and may be calculated from these, and checked for posi- 
tive definiteness. Actually, and must be held fairly far 
away from zero eigenvalues and must not have too large or small 
a volume ratio. The actual threshold for these tests is under 
the control of parameters. If these tests fail, we revert to 
the "basic” solution (this is error 2 above). 

After checking and x^ , the system must be rotated back 
to the original frame of reference by the Inverse of the accu- 
mulated transforms of . These back-transformed 6, a^, and x^ 
can then be used to generate the pair of new clusters. 




SPLIT VERSION 2 PLOW CHART 
(includes subroutine names) 
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Summary 

This report has developed the theory for a new, improved 
version of the SPLIT routine. This version is intended to 
replace the old version which was put together quickly, without 
using any fundamental understanding of the problem. 
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